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Program description^ 


Corrective Reading is designed to promote reading accuracy 
(decoding), fluency, and comprehension skills of students in 
third grade or higher who are reading below their grade level. 
The program has four levels that address students’ decoding 
skills and six levels that address students’ comprehension 
skills. All lessons in the program are sequenced and scripted. 


Corrective Reading can be implemented in small groups of four 
to five students or in a whole-class format. Corrective Reading 
is intended to be taught in 45-minute lessons four to five times a 
week. For the single study reviewed in this report, only the word- 
level skills components of the Corrective Reading program were 
implemented. 


Research 


One study of Corrective Reading met the What Works Clearing- 
house (WWC) evidence standards. This study included 79 third- 
grade students in Pennsylvania.^ 


The WWC considers the extent of evidence for Corrective 
Reading to be small for alphabetics, fluency, and comprehen- 
sion. No studies that met WWC evidence standards with or 
without reservations addressed general reading achievement. 



Effectiveness Corrective Reading was found to have potentially positive effects on alphabetics and fluency and no discernible effects on 
comprehension. 









General reading 


Alphabetics 


Fluency 


Comprehension 


achievement 



Rating of effectiveness 


Potentially positive 


Potentially positive 


No discernible effects na 


Improvement index^ 


Average: +9 percentile 


Average: +11 percentile 


Average: +7 percentile na 




points 


points 


points 




Range: +1 to +13 percen- 




Range: +2 to +11 percen- 




tile points 




tile points 



na = not applicable 

1. The descriptive information for this program was obtained from publicly available sources: the program’s web site ( www.sraonline.com . downloaded 
April, 2007) and the research literature (Torgesen et al., 2006). The WWC requests developers to review the program description sections for accuracy 
from their perspective. Further verification of the accuracy of the descriptive information for this program is beyond the scope of this review. 

2. The evidence presented in this report is based on available research. Findings and conclusions may change as new research becomes available. 

3. These numbers show the average and range of student-level improvement indices for ail findings in the study. 
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Developer and contact 

Corrective Reading is distributed by SRA/McGraw-Hili. 

Address: 220 East Danieidale Road Desoto, TX 75115-2490. 

Web: http://www.sraonline.com/ . Teiephone: (888) 772-4543. 

Scope of use 

No information is availabie on the number of students or schools 
using the program. 

Teaching 

The program’s 45-minute iessons are designed for groups of up 
to 20 students, ideally five times a week. It has two components, 
decoding and comprehension, each with four ieveis. The 
number of lessons varies by component and level. Decoding 
level A, designed for non-readers, has 65 lessons. It emphasizes 
basic decoding skills: rhyming, sounding out, sentence reading, 
and story reading. Levels B1 and B2 (65 lessons) are designed 
for struggling readers who do not read fluently or who confuse 
similar words. These levels teach students to become automatic 
decoders, with the increased self-confidence to read 90 words 
per minute by the end of B1 and 120 words per minute by the 



end of B2. Level C (125 lessons) is designed for students who 
experience difficulty with vocabulary and complex sentence 
structures. This level bridges the gap between advanced word 
decoding skills and the ability to read informational text. All levels 
contain ongoing mastery tests and individual reading checkouts 
to assess individual student achievement. 

Corrective Reading uses a direct instruction approach, a 
scripted presentation that uses a brisk pace, carefully chosen 
exercises and examples, and other presentation techniques. The 
publisher provides approximately seven hours of staff develop- 
ment that focuses on how to deliver direct instruction and use 
the program materials. Follow-up observations and coaching 
are recommended. A Teaching Tutor CD-Rom provides ongoing 
support for teachers using Corrective Reading. 

Cost 

Prices range by level (A, B1, B2, C) and content (decoding, 
comprehension). The cost of student materials ranges from $10 a 
student for level A programs to $50 a student for level C materi- 
als. Teacher materials cost approximately $200 per level. 



Twenty-five studies reviewed by the WWC investigated the 
effects of Corrective Reading. One study (Torgesen et al., 2006) 
was a randomized controlled trial that met WWC evidence stan- 
dards. The remaining 24 studies did not meet evidence screens. 

Torgesen et al. (2006) examined the effects of Corrective 
Reading on 79 third-grade students in eight school units® in 
Pennsylvania.® Students in the comparison group participated in 
the regular reading program at their schools. 



Extent of evidence 

The WWC categorizes the extent of evidence in each domain as 
small or moderate to large (see the What Works Clearinghouse 
Extent of Evidence Categorization Scheme) . The extent of 
evidence takes into account the number of studies and the 
total sample size across the studies that met WWC evidence 
standards with or without reservations.^ 



4. The comprehension component, which was not evaluated in studies reviewed by the WWC, is not fully described here. 

5. A school unit consists of several partnered schools so that the cluster included two third-grade and two fifth-grade instructional groups. Because of the 
age range of the beginning reading review, only data of the third graders were included in this review. 

6. For the purposes of this study, only the word-level skill components of Corrective Reading were implemented, but the study noted that the complete 
version contains instructional routines and materials that also focus on comprehension and vocabulary. 

7. The Extent of Evidence Categorization was developed to tell readers how much evidence was used to determine the intervention rating, focusing on the 
number and size of studies. Additional factors associated with a related concept, external validity, such as the students’ demographics and the types of 
settings in which studies took place, are not taken into account for the categorization. 
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Research (continued) 



Effectiveness 



The WWC found Corrective 
Reading to have potentially 
positive effects on 
alphabetics and fluency 
and no discernible effects 
on comprehension 
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The WWC considers the extent of evidence for Corrective sion. No studies that met WWC evidence standards with or 

Reading to be small for alphabetics, fluency, and comprehen- without reservations addressed general reading achievement. 



Findings 

The WWC review of interventions for beginning reading 
addresses student outcomes in four domains: alphabetics, 
fluency, comprehension, and general reading achievement.® The 
study included here covers three domains: alphabetics, fluency, 
and comprehension. Within the alphabetics domain, the study 
reported on one construct: phonics. 

Aiphabetics. Torgesen et al. (2006) examined four outcomes 
under the phonics construct of the alphabetics domain (Woodcock 
Reading Mastery Test-Revised (WRMT-R) word identification and 
word attack subtests and the Test of Word Reading Efficiency 
(TOWRE) phonetic decoding efficiency and sight word efficiency 
subtest). The authors reported statistically significant positive 
effects of Corrective Reading on two of these outcomes (WRMT-R 
word identification subtest and TOWRE sight word efficiency sub- 
test). The statistical significance of these findings was confirmed by 
the WWC. The average effect size across the four outcomes was 
neither statistically significant nor large enough to be considered 
substantively important (that is, an effect size greater than 0.25). 

Fiuency. Torgesen et al. (2006) examined one outcome in this 
domain (the Oral Reading Fluency test) and reported statistically 



significant positive effects on this outcome. The WWC analysis 
confirmed the statistical significance of the finding. 

Comprehension. Torgesen et al. (2006) examined two 
outcomes in this domain (WRMT-R passage comprehension 
subtest and the Group Reading Assessment and Diagnostic 
Evaluation (GRADE) passage comprehension subtest) and 
reported no statistically significant effects. The average effect 
size across the two outcomes was neither statistically significant 
nor large enough to be considered substantively important 
according to WWC criteria. 

Rating of effectiveness 

The WWC rates the effects of an intervention in a given outcome 
domain as positive, potentially positive, mixed, no discernible 
effects, potentially negative, or negative. The rating of effective- 
ness takes into account four factors: the quality of the research 
design, the statistical significance of the findings,® the size of 
the difference between participants in the intervention and the 
comparison conditions, and the consistency in findings across 
studies (see the WWC Intervention Rating Scheme) . 



Improvement index 

The WWC computes an improvement index for each individual 
finding. In addition, within each outcome domain, the WWC 
computes an average improvement index for each study and an 
average improvement index across studies (see Technical Details 
of WWC-Conducted Computations^ . The improvement index rep- 
resents the difference between the percentile rank of the average 



student in the intervention condition versus the percentile rank of 
the average student in the comparison condition. Unlike the rating 
of effectiveness, the improvement index is based entirely on the 
size of the effect, regardless of the statistical significance of the 
effect, the study design, or the analyses. The Improvement index 
can take on values between -50 and +50, with positive numbers 
denoting results favorable to the intervention group. 



8. For definitions of the domains, see the Beginning Reading Protocol . 

9. The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within class- 
rooms or schools and for multiple comparisons. For an explanation, see the WWC Tutorial on Mismatch . See the Technical Detaiis of WWC-Conducted 
Computations for the formulas the WWC used to calculate the statistical significance. In the case of Corrective Reading, no corrections for clustering or 
multiple comparisons were needed. 
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The WWC found Corrective 
Reading to have potentially 
positive effects on 
alphabetics and fluency and 
no discernible effects on 
comprehension (continued) 



The average improvement index for alphabetics is +9 percentile 
points across all findings in the single study, with a range of +1 to 
+13 percentile points. The improvement index for the single out- 
come for fluency is +11 percentile points. The average improvement 
index for comprehension is +7 percentile points across all findings 
in the single study, with a range of +2 to +11 percentile points. 



Summary 

The WWC reviewed 25 studies on Corrective Reading. One study 
met the WWC evidence standards.''° Based on this one study, 
the WWC found potentially positive effects in the alphabetics 
and fluency domains and no discernible effects in the compre- 
hension domain. The evidence presented in this report may 
change as new research emerges. 
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Appendix 



Appendix A1 


Study characteristics: Torgesen et ai., 2006 (randomized controiied triai) 


Characteristic 


Description 


Study citation 


Torgesen, J., Myers, D., Schirm, A., Stuart, E., Vartivarian, S., Mansfield, W., et al. (2006). National assessment of Title 1 interim report — Volume II: Closing the reading gap: 
First year findings from a randomized trial of four reading interventions for striving readers. Retrieved from Institute of Education Sciences, U.S. Department of Education Web 
site: httD://www.ed.aov/rschstat/eval/disadv/title1interimreDort/index.html 


Participants 


The study design was based on random assignment of 37 school unifs^ to one of four interventions, Corrective Reading, Kaplan SpellRead, Failure Free Reading, or Wilson 
Reading. Within each school, students were randomly assigned to the intervention or to the comparison condition. This report focuses on eight school units assigned to Cor- 
rective Reading.^ At the time of analysis, the sample included 79 third-grade students (44 in intervention and 35 in comparison groups). The number of students at baseline 
was not reported.^ About 44% of the intervention group students were female, compared with 52% in the comparison group. About 42% of fhe intervention group students 
were eligible for free/reduced lunch program and 48% in the comparison group. 


Setting 


Eight school units in Pennsylvania. 


Intervention 


Corrective Reading ms implemented by nine teachers beginning in the first week in November 2003 through the first weeks in May 2004. The intervention was administered 
to groups of three students with heterogeneous basic reading skills. The average skills of each three-student instructional group determined the pace of instruction. 
Implementation fidelity was determined by reading program trainers who observed the teachers and coached them over a period of months, by project coordinators who 
observed a sample of instructional sessions, and by ratings based on a sample of videotaped sessions. Implementation was rated as acceptable. Eor purposes of this study, 
only the word-level skills components were implemented. However, the complete program of Corrective Reading also contains instructional routines and materials that focus on 
comprehension and vocabulary. 


Comparison 


The comparison group students received their regular reading instruction, which included typical classroom instruction and, in many cases, other services (such as another 
pull-out program). The comparison group students had fewer small-group instructional hours than the intervention group students, but more one-on-one instructional hours. 


Primary outcomes 
and measurement 


The primary outcome measures in the alphabetics domain were the word identification and word attack subtests of the Woodcock Reading Mastery Test-Revised (WRMT-R) 
and the phonetic decoding efficiency subtest of the Test of Word Reading Efficiency (TOWRE). The primary measures in the fluency domain were the TOWRE sight word 
efficiency subtest and the Oral Reading Fluency test. The primary measures in the comprehension domain were the passage comprehension subtests of WRMT-R and the 
Group Reading Assessment and Diagnostic Evaluation (GRADE). (See Appendix A2.1-2.3 for more detailed descriptions of outcome measures.) 


Teacher training 


Professional development included training and coaching by reading program staff, independent study of program materials, and telephone conferences. On average, interven- 
tion group teachers participated in 70.8 professional development hours across all phases of the study (initial training phase, practice phase, and implementation phase). 



1 . A school unit consists of several partnered schools so that the cluster included two third-grade and two fifth-grade instructional groups. 

2. Findings on Kaplan SpellRead, Failure Free Reading, and Wilson Reading are included in other WWC beginning reading reports. 

3. The study reported that 10 students in the intervention group were lost to analysis. However, it is not clear whether those students were in third grade or were part of an additional sample of 
fifth-grade students that was also examined in this study. The fifth-grade sample that was included in this study is not reviewed in this report because it is outside the scope of the review. For 
sample relevancy criteria please see the Beginning Reading Protocol . 
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Appendix A2.1 Outcome measures in the aiphabetics domain 



Outcome measure 


Description 


Phonics 

Test of Word Reading 
Efficiency (TOWRE): Phonetic 
Decoding Efficiency subtest 


The TOWRE is a standardized measure. The phonetic decoding efficiency subtest measures the number of pronounceabie printed nonwords that can be accurateiy decoded 
within 45 seconds (as cited in Torgesen et ai., 2006). 


TOWRE; Sight Word 
Efficiency subtest 


The TOWRE is a standardized measure. The sight word efficiency subtest assesses the number of reai printed words that can be accurateiy identified wifhin 45 seconds (as 
cited in Torgesen et ai., 2006). 


Woodcock Reading Mastery 
Test-Revised (WRMT-R): 
Word Identification subtest 


The word identification subtest is a test of decoding skiiis. The standardized test requires the chiid to read aioud isoiated reai words that range in frequency and difficuity (as 
cited in Torgesen et ai., 2006). 


WRMT-R; Word 
Attack subtest 


This standardized test measures phonemic decoding skiiis by asking students to read pseudowords. Students are aware that the words are not reai (as cited in Torgesen et ai., 
2006). 



Appendix A2.2 Outcome measure in the fluency domain 



Outcome measure 


Description 


Edformation Oral 
Fluency Assessment 


This test measures the number of words correct per minute (WCPM) that students read using three brief grade-ievei passages (AiMSweb, as cited in Torgesen et ai., 2006). 
These passages inciude both fiction and nonfiction text. The norms for fhis test are updated by Edformation each schooi year. 



Appendix A2.3 Outcome measures in the comprehension domain 



Outcome measure 


Description 


Reading comprehension 

Group Reading Assessment 
and Diagnostic Evaluation 
(GRADE); Passage 
Comprehension subtest 


The GRADE is an untimed norm-referenced sfandardized fest. The passage comprehension subtesf inciudes a passage of text and corresponding muitipie-choice comprehen- 
sion questions (as cited in Torgesen et ai., 2006). 


WRMT-R; Passage 
Comprehension subtest 


in this standardized test, comprehension is measured by having students fiii in missing words in a short paragraph (as cited in Torgesen et ai., 2006). 
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Appendix A3.1 



Summary of study findings inciuded in the rating for the aiphahetics domain^ 









Authors’ findings from the study 














Mean outcome 
(standard deviation^) 




WWC calculations 




Outcome measure 


Study 

sample 


Sample size 
(school units/ 
students) 


Corrective 

Reading Comparison 

group group 


Mean difference^ 
{Corrective 
Reading - 
comparison) 


Statistical 
significance^ 
Effect size^ (at a = 0.05) 


Improvement 

index^ 









Torgesen et al., 2006 (randomized controlled trial)^ 








TOWRE: Phonetic Decoding 
Efficiency subtest 


Grade 3 


8/79 


89.86 

(15.00) 


89.48 

(15.00) 


0.38 


0.02 


ns 


+1 


TOWRE: Sight Word 
Efficiency subtest 


Grade 3 


8/79 


90.98 

(15.00) 


86.41 

(15.00) 


4.57 


0.30 


Statistically 

significant 


+12 


WRMT-R: Word 
Identification subtest 


Grade 3 


8/79 


91.06 

(15.00) 


87.77 

(15.00) 


3.30 


0.22 


Statistically 

significant 


+9 


WRMT-R: Word Attack subtest 


Grade 3 


8/79 


100.34 


95.15 


5.18 


0.34 


ns 


+13 



(15.00) (15.00) 

Domain average^ for alphabetics 0.22 na +9 



ns = not statistically significant 

na = not applicable 

1 . This appendix reports findings considered for fhe effecfiveness rafing and the average improvement indices. The study also included subgroup analyses by initial skill level (WRMT-R word attack subtest and Peabody Picture Vocabulary 
Test (PPVT)) and socioeconomic status. The study found no differences for alphabetics outcomes as a function of students’ initial level of word attack scores. The study reported that Corrective Reading had a statistically significant 
positive effect on alphabetics for students with high pretest PPVT scores and no effects on students with low pretest PPVT scores. An additional subgroup analysis found statistically significant positive effects on the TOWRE sight word 
efficiency subtest for students not eligible for the free/reduced lunch program, but not for students eligible for the free/reduced lunch program. 

2. The standard deviation across all students in each group shows how dispersed the participants’ outcomes are: a smaller standard deviation on a given measure would indicate that participants had more similar outcomes. The standard 
deviations in Torgesen et al. (2006) were the population standard deviations for these standardized outcomes. 

3. Positive differences and effect sizes favor the intervention group; negative differences and effect sizes favor the comparison group. The intervention group mean is the comparison group mean plus the mean difference. 

4. For an explanation of the effect size calculation, see Technical Details of WWC-Conducted Computations . 

5. Statistical significance is the probability that the difference between groups is a result of chance rather than a real difference between the groups. 

6. The improvement index represents the difference between the percentile rank of the average student in the intervention condition versus the percentile rank of the average student in the comparison condition. The improvement index 
can take on values between -50 and +50, with positive numbers denoting results favorable to the intervention group. 

7. The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within classrooms or schools and for multiple comparisons. For an explanation about the clus- 
tering correction, see the WWC Tutorial on Mismatch . See Technical Details of WWC-Conducted Computations tor the formulas the WWC used to calculate statistical significance. In the case of Torgesen et al. (2006) and the alphabetics 
domain, corrections for multiple comparisons were needed because the study’s reported corrections for multiple comparisons were based on a grouping of outcomes that differed from the groupings of domains for this review. 

8. This row provides the study average, which in this instance is also the domain average. The WWC-computed domain average effect size is a simple average rounded to two decimal places. The domain improvement index is calculated 
from the average effect size. 
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Appendix A3.2 Summary of study findings inciuded in the rating for the fluency domain^ 









Authors’ findings from the study 














Mean outcome 
(standard deviation^) 




WWC calculations 




Outcome measure 


Study 

sample 


Sample size 
(school units/ 
students^) 


Corrective 

Reading Comparison 

group group 


Mean difference'* 
{Corrective 
Reading - 
comparison) 


Statistical 
significance^ 
Effect size® (at a = 0.05) 


Improvement 

index^ 









Torgesen et al., 2006 (randomized controlled trial)^ 








Oral Reading Fluency 


Grade 3 


8/79 


66.04 55.33 10.71 

(39.20) (39.20) 


0.27 


Statistically 

significant 


-Fll 


Domain average® for fluency 








0.27 


Statistically 

significant 


-Fll 



1 . This appendix reports findings considered for the effectiveness rating and the average improvement indices. The study aiso inciuded subgroup anaiyses by initiai skiii ievei (WRMT-R word attack subtest and Peabody Picture Vocabuiary 
Test (PPVT)) and socioeconomic status. The study found statisticaiiy significant positive effects on the Orai Fiuency Test for students not eiigibie for the free/reduced iunch program, but not for students eiigibie for the free/reduced iunch 
program. No differences were found between initiai skiii ievei subgroups of students for the fiuency outcome. 

2. The standard deviation across aii students in each group shows how dispersed the participants’ outcomes are: a smaiier standard deviation on a given measure wouid indicate that participants had more simiiar outcomes. The standard 
deviations in Torgesen et ai. (2006) were the popuiation standard deviations for these standardized outcomes. 

3. The sampie size for the anaiysis was not reported in the study. The sampie size reported is the totai number of third-grade students in the intervention and controi conditions at baseiine, which may differ from the actuai number of 
students used in the anaiysis in the report. 

4. Positive differences and effect sizes favor the intervention group; negative differences and effect sizes favor the comparison group. The intervention group mean is the comparison group mean pius the mean difference. 

5. For an expianation of the effect size caicuiation, see Technicai Detaiis of WWC-Conducted Computations . 

6. Statisticai significance is the probabiiity that the difference between groups is a resuit of chance rather than a reai difference between the groups. 

7. The improvement index represents the difference between the percentiie rank of the average student in the intervention condition versus the percentiie rank of the average student in the comparison condition. The improvement index 
can take on vaiues between -50 and -i-50, with positive numbers denoting resuits favorabie to the intervention group. 

8. The ievei of statisticai significance was reported by the study authors or, where necessary, caicuiated by the WWC to correct for ciustering within ciassrooms or schoois and for muitipie comparisons. For an expianation about the 
ciustering correction, see the WWC Tutoriai on Mismatch . See Technicai Detaiis of WWC-Conducted Computations for the formuias the WWC used to caicuiate statisticai significance, in the case of Torgesen et ai. (2006) and the fiuency 
domain, no corrections for ciustering were needed because students were assigned to conditions. No corrections for muitipie comparisons were needed because there is oniy one outcome in this domain. 

9. This row provides the domain average, which in this instance is aiso the singie study finding. 
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Appendix A3.3 Summary of study findings inciuded in the rating for the comprehension domain^ 









Authors’ findings from the study 














Mean outcome 
(standard deviation^) 




WWC calculations 




Outcome measure 


Study 

sample 


Sample size 
(school units/ 
students^) 


Corrective 

Reading Comparison 

group group 


Mean difference'* 
{Corrective 
Reading - 
comparison) 


Statistical 
significance^ 
Effect size® (at a = 0.05) 


Improvement 

index^ 



Torgesen et al., 2006 (randomized controlled trial)^ 



WRMT: Passage 


Grade 3 


8/79 


93.16 


92.30 


0.86 


0.06 


ns 


-f2 


Comprehension subtest 






(15.00) 


(15.00) 










GRADE: Passage 


Grade 3 


8/79 


87.39 


83.22 


4.18 


0.28 


ns 


-fII 


Comprehension subtest 






(15.00) 


(15.00) 











Domain average^ for comprehension 0.17 ns +7 



ns = not statistically significant 

1 . This appendix reports findings considered for the effectiveness rating and the average improvement indices. The study also included subgroup analyses by initial skill level (WRMT-R word attack subtest and Peabody Picture Vocabulary 
Test (PPVT)) and socioeconomic status. No differences were found between subgroups of students for outcomes in the comprehension domain. 

2. The standard deviation across all students in each group shows how dispersed the participants’ outcomes are: a smaller standard deviation on a given measure would indicate that participants had more similar outcomes. The standard 
deviations in the Torgesen et al. (2006) study were the population standard deviations for these standardized outcomes. 

3. The sample size for the analysis was not reported in the study. The sample size reported is the total number of third-grade students in the intervention and control conditions at baseline, which may differ from the actual number of 
students used in the various analysis in the report. 

4. Positive differences and effect sizes favor the intervention group; negative differences and effect sizes favor the comparison group. The intervention group mean is the comparison group mean plus the mean difference. 

5. For an explanation of the effect size calculation, see Technical Details of WWC-Conducted Computations . 

6. Statistical significance is the probability that the difference between groups is a result of chance rather than a real difference between the groups. 

7. The improvement index represents the difference between the percentile rank of the average student in the intervention condition versus the percentile rank of the average student in the comparison condition. The improvement index 
can take on values between -50 and -i-50, with positive numbers denoting results favorable to the intervention group. 

8. The level of statistical significance was reported by the study authors or, where necessary, calculated by the WWC to correct for clustering within classrooms or schools and for multiple comparisons. For an explanation about the 
clustering correction, see the WWC Tutorial on Mismatch . See Technical Details of WWC-Conducted Computations for the formulas the WWC used to calculate statistical significance. In the case of Torgesen et al. (2006) and the com- 
prehension domain, no corrections for clustering were needed. No corrections for multiple comparisons were needed because the study’s reported corrections for multiple comparisons were based on the same group of outcomes as the 
domain for this review. 

9. This row provides the domain average, which in this instance is also the study average. The WWC-computed domain average effect size is a simple average rounded to two decimal places. The domain improvement index is calculated 
from the average effect size. 
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Appendix A4.1 Corrective Reading rating for the aiphabetics domain 



The WWC rates an intervention’s effects in a given outcome domain as positive, potentiaily positive, mixed, no discernibie effects, potentialiy negative, or negative.'' 

For the outcome domain of aiphabetics, the WWC rated Corrective Reading as potentially positive. It did not meet the criteria for positive effects because only 
one study showed a statistically significant positive effect. The remaining ratings (mixed, no discernible effects, potentially negative, or negative) were not considered 
because Corrective Reading was assigned the highest applicable rating. 

Rating received 

Potentially positive effects: Evidence of a positive effect with no overriding contrary evidence. 

• Criterion 1: At least one study showing a statistically significant or substantively important positive effect. 

Met. One study showed a statistically significant positive effect. 

AND 

• Criterion 2: No studies showing a statistically significant or substantively important negative effect and fewer or the same number of studies showing indeterminate 
effects than showing statistically significant or substantively important positive effects. 

Met. No studies showed a statistically significant or substantively important negative effect. The single study that met the WWC standards showed 
a statistically significant positive effect. 

Other ratings considered 

Positive effects: Strong evidence of a positive effect with no overriding contrary evidence. 

• Criterion 1: Two or more studies showing statistically significant positive effects, at least one of which met WWC evidence standards for a strong design. 

Not met. Only one study showed a statistically significant positive effect. 

AND 

• Criterion 2: No studies showing statistically significant or substantively important negative effects. 

Met. No studies showed statistically significant or substantively important negative effects. 

1. For rating purposes, the WWC considers the statistical significance of individual outcomes and the domain-level effect. The WWC also considers the size of the domain-level effect for ratings of 
potentially positive or potentially negative effects. See the WWC Intervention Rating Scheme for a complete description. 
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Appendix A4.2 Corrective Reading rating for the fluency domain 



The WWC rates an intervention’s effects in a given outcome domain as positive, potentiaily positive, mixed, no discernibie effects, potentialiy negative, or negative.'' 

For the outcome domain of fluency, the WWC rated Corrective Reading as potentialiy positive. It did not meet the criteria for positive effects because only one study 
showed statistically significant positive effects. The remaining ratings (mixed, no discernibie effects, potentialiy negative, or negative) were not considered because Cor- 
rective Reading was assigned the highest appiicabie rating. 

Rating received 

Potentially positive effects: Evidence of a positive effect with no overriding contrary evidence. 

• Criterion 1: At ieast one study showing a statisticaiiy significant or substantively important positive effect. 

Met. One study showed a statistically significant positive effect. 

AND 

• Criterion 2: No studies showing a statistically significant or substantively important negative effect and fewer or the same number of studies showing indeterminate 
effects than showing statisticaiiy significant or substantively important positive effects. 

Met. No studies showed a statistically significant or substantively important negative effect. The single study that met the WWC standards showed 
a statistically significant positive effect. 

Other ratings considered 

Positive effects: Strong evidence of a positive effect with no overriding contrary evidence. 

• Criterion 1: Two or more studies showing statistically significant positive effects, at least one of which met WWC evidence standards for a strong design. 

Not met. Only one study showed a statistically significant positive effect. 

AND 

• Criterion 2: No studies showing statistically significant or substantively important negative effects. 

Met. No studies showed statistically significant or substantively important negative effects. 

1. For rating purposes, the WWC considers the statistical significance of individual outcomes and the domain-level effect. The WWC also considers the size of the domain-level effect for ratings of 
potentially positive or potentially negative effects. See the WWC Intervention Rating Scheme for a complete description. 
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Appendix A4.3 Corrective Reading rating for the comprehension domain 



The WWC rates an intervention’s effects in a given outcome domain as positive, potentiaily positive, mixed, no discernibie effects, potentialiy negative, or negative.'' 

For the outcome domain of comprehension, the WWC rated Corrective Reading as having no discernible effects. It did not meet the criteria for other ratings (positive, 
potentially positive, mixed, potentially negative, and negative) because the single study that met WWC standards did not show statistically significant or substantiveiy 
important effects. 

Rating received 

No discernible effects: No affirmative evidence of effects. 

• Criterion 1: None of the studies shows a statisticaiiy significant or substantively important effect, either pos/f/Ve or negative. 

Met. No studies showed statisticaiiy significant or substantively important positive or negative effects. 

Other ratings considered 

Positive effects: Strong evidence of a positive effect with no overriding contrary evidence. 

• Criterion 1: Two or more studies showing statisticaiiy significant positive effects, at least one of which met WWC evidence standards for a strong design. 

Not met. No studies showed statisticaiiy significant positive effects. 

AND 

• Criterion 2: No studies showing statisticaiiy significant or substantively important negative effects. 

Met. No studies showed statisticaiiy significant or substantively important negative effects. 

Potentially positive effects: Evidence of a positive effect with no overriding contrary evidence. 

• Criterion 1: At ieast one study showing a statisticaiiy significant or substantively important positive effect. 

Not met. No studies showed statisticaiiy significant or substantively important positive effects. 

AND 

• Criterion 2: No studies showing a statistically significant or substantively important negative effect and fewer or the same number of studies showing indeterminate 
effects than showing statisticaiiy significant or substantively important positive effects. 

Not met. The single study that met WWC standards showed indeterminate effects. 

Mixed effects: Evidence of inconsistent effects as demonstrated through either of the foiiowing criteria. 

• Criterion 1: At ieast one study showing a statisticaiiy significant or substantively important positive effect, and at least one study showing a statistically significant 
or substantively important negative effect, but no more such studies than the number showing a statistically significant or substantively important positive effect. 

Not met. No studies showed statistically significant or substantively important effects, either positive or negative. 

OR 

• Criterion 2: At least one study showing a statistically significant or substantively important effect, and more studies showing an indeterminate effect than showing a 
statistically significant or substantively important effect. 

Not met. No studies showed statistically significant or substantively important effects, either positive or negative. 

(continued) 
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Appendix A4.3 Corrective Reading rating for the comprehension domain (continued) 



Potentially negative effects: Evidence of a negative effect with no overriding contrary evidence. 

• Criterion 1: At ieast one study showing a statisticaiiy significant or substantively important negative effect. 

Not met. No studies showed statisticaiiy significant or substantively important negative effects. 

AND 

• Criterion 2: No studies showing a statistically significant or substantively important positive effect, or more studies showing statisticaiiy significant or substantively 
important negative effects than showing statistically significant or substantively important pos/'f/Ve effects. 

Met. No studies showed statistically significant or substantively important positive effects. In addition, no studies showed a statistically significant 
or substantively important negative effect. 

Negative effects: Strong evidence of a negative effect with no overriding contrary evidence. 

• Criterion 1: Two or more studies showing statistically significant negative effects, at least one of which met WWC evidence standards for a strong design. 

Not met. No studies showed statistically significant negative effects. 

AND 

• Criterion 2: No studies showing statistically significant or substantively important positive effects. 

Met. No studies showed statistically significant or substantively important positive effects. 

1 . For rating purposes, the WWC considers the statistical significance of individual outcomes and the domain-level effect. The WWC also considers the size of the domain-level effect for ratings of 
potentially positive or potentially negative effects. See the WWC Intervention Rating Scheme for a complete description. 
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Appendix A5 Extent of evidence by domain 



Outcome domain 


Number of studies 


Sample size 

School units 


Students 


Extent of evidence^ 


Alphabetics 


1 


8 


More than 70 


Small 


Fluency 


1 


8 


More than 70 


Small 


Comprehension 


1 


8 


More than 70 


Small 


General reading achievement 


0 


0 


0 


na 



na = not applicable/not studied 

1 . A rating of “moderate to large” requires at least two studies and two schools across studies in one domain, and a total sample size across studies of at least 350 students or 14 classrooms. 
Otherwise, the rating is “small.” 
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